Automatic pronunciation scoring using learning to rank and DP-based score segmentation

نویسندگان

  • Liang-Yu Chen
  • Jyh-Shing Roger Jang
چکیده

This paper proposes a novel automatic pronunciation scoring framework using learning to rank. Human scores of the utterances are treated as ranks and are used as the ranking ground truths. Scores generated from various existing scoring methods are used as the features to train the learning to rank function. The output of the function is then segmented by the proposed DP-based method and hence boundaries between clusters can be used to determine the discrete computer scores. Experimental results show that the proposed framework improves upon the existing scoring methods. A non-native corpus with human ranks is also released.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement in Automatic Pronunciation Scoring using Additional Basic Scores and Learning to Rank

This paper proposes the adoption of different word-level scores in the framework of automatic pronunciation scoring using learning to rank. Six types of phone-level scores are first computed and converted to word-level scores by using average-based, vowel-based, and consonant-based methods. Different score combination methods are then used to combine these word-level scores to obtain the final ...

متن کامل

A multi-scale convolutional neural network for automatic cloud and cloud shadow detection from Gaofen-1 images

The reconstruction of the information contaminated by cloud and cloud shadow is an important step in pre-processing of high-resolution satellite images. The cloud and cloud shadow automatic segmentation could be the first step in the process of reconstructing the information contaminated by cloud and cloud shadow. This stage is a remarkable challenge due to the relatively inefficient performanc...

متن کامل

Neural Network-Based Learning Kernel for Automatic Segmentation of Multiple Sclerosis Lesions on Magnetic Resonance Images

Background: Multiple Sclerosis (MS) is a degenerative disease of central nervous system. MS patients have some dead tissues in their brains called MS lesions. MRI is an imaging technique sensitive to soft tissues such as brain that shows MS lesions as hyper-intense or hypo-intense signals. Since manual segmentation of these lesions is a laborious and time consuming task, automatic segmentation ...

متن کامل

Automatic Mandarin pronunciation scoring for native learners with dialect accent

This paper studies pronunciation scoring algorithm in CALL system aiming at teaching native Chinese learn standard Mandarin. Most of the pronunciation scoring algorithms focus on non-native environment, which may not be suitable for native speakers. We bring up a new algorithm based on traditional posterior log-likelihood algorithm by weighting the initial part of Mandarin syllables, where fina...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010